Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences
نویسندگان
چکیده
We present MOWCATL, an efficient method for mining frequent sequential association rules from multiple sequential data sets with a time lag between the occurrence of an antecedent sequence and the corresponding consequent sequence. This approach finds patterns in one or more sequences that precede the occurrence of patterns in other sequences, with respect to user-specified constraints. In addition to the traditional frequency and support constraints in sequential data mining, this approach uses separate antecedent and consequent inclusion constraints. Moreover, separate antecedent and consequent maximum window widths are used to specify the antecedent and consequent patterns that are separated by the maximum time lag. We use multiple time series drought risk management data to show that our approach can be effectively employed in real-life problems. The experimental results validate the superior performance of our method for efficiently finding relationships between global climatic episodes and local drought conditions. We also compare our new approach to existing methods and show how they complement each other to discover associations in a drought risk management decision support system.
منابع مشابه
Discovering Representative Episodal Association Rules from Event Sequences Using Frequent Closed Episode Sets and Event Constraints
Discovering association rules from time-series data is an important data mining problem. The number of potential rules grows quickly as the number of items in the antecedent grows. It is therefore difficult for an expert to analyze the rules and identify the useful. An approach for generating representative association rules for transactions that uses only a subset of the set of frequent itemse...
متن کاملA Dissertation Proposal: Associating and Predicting Episodes of Events in Multiple Time Series for Supporting Policy Decision Making
Abstract. Many business and scientific domains require the collection and analysis of sequences of events and time series data. Although statistical approaches have been long applied to time series, most of these approaches assume the time series is stationary and typically must be applied globally to the sequence. Thus, other methods are needed to solve many types of problems that occur in seq...
متن کاملDiscovering Active and Profitable Patterns with Rfm (recency, Frequency and Monetary) Sequential Pattern Mining–a Constraint Based Approach
Sequential pattern mining is an extension of association rule mining that discovers time-related behaviors in sequence database. It extends association by adding time to the transactions. The problem of finding association rules concern with intratransaction patterns whereas that of sequential pattern mining concerns with inter-transaction patterns. Generalized Sequential Pattern (GSP) mining a...
متن کاملCMRules: Mining sequential rules common to several sequences
Sequential rule mining is an important data mining task used in a wide range of applications. However, current algorithms for discovering sequential rules common to several sequences use very restrictive definitions of sequential rules, which make them unable to recognize that similar rules can describe a same phenomenon. This can have many undesirable effects such as (1) similar rules that are...
متن کاملCMRules: Mining Sequential Rules
Sequential rule mining is an important data mining task with wide applications. However, current algorithms for discovering sequential rules common to several sequences use very restrictive definitions of sequential rules, which make them unable to recognize that similar rules can describe a same phenomenon. This can have many undesirable effects such as (1) similar rules that are rated differe...
متن کامل